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Constant  Information  Model  can  be  used  when  we  substitute  a 
subset  of  equivalent,  binary  items,  whose  item  characteristic  functions 
are  unknoim,  for  the  •'Old  Test,**  or  a  subset  of  test  items  whose  operat¬ 
ing  characteristics  are  known,  in  estimating  the  operating  characteristics 
of  item  response*  In  so  doing,  it  has  been  suggested  that  we  choose  items 
whose  common  discrimination  power  is  low,  so  that  the  interval  of  latent 
trait,  for  which  their  common  item  characteristic  function  in  the  Constant 
Information  Model  assumes  positive  values,  is  wide  enough  to  cover  the 
ability  levels  of  all  the  examinees,'  on  which  the  operating  characteristics 
of  new  items  are  to  be  estimated. 

This  suggestion  needs  more  investigation  and  precision,  however, 
since  it  is  expected  from  theory  that  the  convergence  of  the  conditional 
distribution  of  the  maximum  likelihood  estimate,  given  latent  trait,  to 
the  normality  is  slow  for  the  values  of  latent  trait  which  are  close  to 
the  two  endpoints  of  the  above  interval,  in  comparison  with  those  close 
to  the  midpoint.  In  this  paper,  through  a  simulation  study,  the  speed 
of  convergence  of  the  conditional  distribution  of  the  maximum  likelihood 
estimate  to  the  normality  at  various  levels  of  latent  trait  is  observed, 
using  twenty  hypothetical  test  sessions,  in  each  of  which  ten  equivalent, 
binary  test  items  are  given.  As  was  expected,  the  conditional  distribution 
of  the  maximum  likelihood  estimate  is  skewed  for  the  values  of  latent 
trait  close  to  tlte  two  endpoints  of  the  interval,  and  the  convergence  is 
slower.  ,  From  these  results,  the  above  suggestion  is 
sion,  whtcli  is  essential  to  the  researclicrs  who  wish 
in  estimating  the  operating  characteristics. 
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Constant  Information  Model  can  be  used  when  we  substitute  a 
subset  of  equivalent,  binary  items,  whose  item  characteristic  functions 
are  unknown,  for  the  "Old  Test,"  or  a  subset  of  test  items  whose  operat¬ 
ing  characteristics  are  known,  in  estimating  the  operating  characteristics 
of  item  response.  In  so  doing,  it  has  been  suggested  that  we  choose  items 
whose  common  discrimination  power  is  low,  so  that  the  interval  of  latent 
trait,  for  which  their  common  item  characteristic  function  in  the  Constant 
Information  Model  assumes  positive  values,  is  wide  enough  to  cover  the 
ability  levels  of  all  the  examinees,  on  which  the  operating  characteristics 
of  new  items  are  to  be  estimated. 

This  suggestion  needs  more  investigation  and  precision,  however, 
since  it  is  expected  from  theory  that  the  convergence  of  the  conditional 
distribution  of  the  maximum  likelihood  estimate,  given  latent  trait,  to 
the  normality  is  slow  for  the  values  of  latent  trait  which  are  close  to 
the  two  endpoints  of  the  above  interval,  in  comparison  with  those  close 
to  the  midpoint.  In  this  paper,  through  a  simulation  study,  the  speed 
of  convergence  of  the  conditional  distribution  of  the  maximum  likelihood 
estimate  to  the  normality  at  various  levels  of  latent  trait  is  observed, 
using  twenty  hypothetical  test  sessions,  in  each  of  which  ten  equivalent, 
binary  test  items  are  given.  As  was  expected,  the  conditional  distribution 
of  the  maximum  likelihood  estimate  is  skewed  for  the  values  of  latent 
trait  close  to  the  two  endpoints  of  the  interval,  and  the  convergence  is 
slower.  From  these  results,  the  above  suggestion  is  added  by  more  preci¬ 
sion,  which  is  essential  to  the  researchers  who  wish  to  apply  this  method 
in  estimating  the  operating  characteristics. 
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I  Introduction 


Constant  Information  Model  on  the  dichotomous  response  level 
(Samejima,  1972)  of  latent  trait  theory  has  been  proposed  and  discussed 
(cf.  Samejima,  1979).  Let  9  be  a  unidimensional  latent  trait, 
whose  range  is  given  by 

(1.1)  <  6  <  00  ^ 

and  g  be  a  binary  item.  The  model  is  defined  by  the  item  character¬ 
istic  function,  »  such  that 

(1.2)  P^(0)  -  sln^[a  (0-b  )  +  (tt/4)  ]  , 

where  a  and  b  are  the  discrimination  and  difficulty  parameters, 

S  S 

respectively.  The  model  provides  us  with  such  an  item  information 

8  <  9  <  0 

otherwise  , 

(l.A)  0  -  [-TTa  ^/A)  +  b 

8  8 

and 

(1.5)  0  -  [Tia  “^/Al  +  b 

When  we  consider  a  subinterval  of  0  such  that 

(1.6)  9  <  0  <  0 


function,  I  (0)  ,  that 
g 


(1.3) 


Aa 

i_(e)  i  ® 

0 


g 


where 


-1 


.  w 


L 


as  Its  range,  the  item  characteristic  function  of  the  Constant  Inform¬ 
ation  Model  is  strictly  increasing  in  0  ,  with  r.ero  and  unity  as  its 

t%ro  asymptotes  -  the  characteristics  shared  by  the  normal  ogive 

model,  the  logistic  model,  and  so  on.  For  convenience,  hereafter, 
we  shall  call  the  set  of  models  which  share  this  particular  character¬ 
istic  Type  A  .  It  has  been  shown  iSamejima,  1979)  that  the  area 
under  the  curve  of  the  square  root  of  the  item  information  function 
equals  1T  regardless  of  the  model,  provided  that  it  belongs  to 
Type  A  . 

One  usefulness  of  the  Constant  Information  Model  is  that, 
by  virtue  of  the  transformat  ion- free  character  (Samejima,  1969) 
of  the  maximum  likelihood  estimate,  we  can  adopt  this  model  whenever 
we  locate  a  suitable  subset  of  equivalent,  binary  items  in  a  newly 
developed  item  pool,  and  use  this  subset  of  items  as  a  substitute 
for  the  set  of  items  whose  operating  characteristics  are  known 
(cf.  Samejima,  1979  ).  Thus  we  can  make  use  of  various  combinations 
of  a  method  and  an  approach  for  estimating  the  operating  characteris¬ 
tics  of  the  item  response  categories  (Samejima,  1977a,  1977b,  1978a, 
1978b,  1978c,  1978d,  1978e,  1978f)  without  depending  upon  the  subset 
of  a  priori  investigated  items,  or  the  Old  Test.  The  only  assump¬ 
tion  used  in  this  procedure  is  that  the  "true"  item  characteristic 
function,  which  is  common  to  these  equivalent,  binary  items,  belongs 
to  Type  A. 

In  the  actual  process  of  estimating  the  operating  character¬ 
istics,  a  combination  of  a  method  and  an  approach  is  selected  for 


use.  In  thtfsn  v  nuh Inat  tons  ot  a  motituil  snii  i»n  approach,  the 
asymptotic  normality  of  the  condttlonal  distrihntlon  ot  the  maximum 
llkellhooti  estimate,  0  ,  jtlven  0  ,  plays  an  important  role. 

Kor  the  test  ot  n  equivalent,  binary  items.  Constant 
Intormation  Model  provides  us  with  sucli  a  test  Intoimatlon  tunction 
that 

(1.7)  IfO)  -  sna  “  , 

tt 

for  the  Interval  of  0  qlven  bv  ll.o).  In  the  asymptotio  normal 
distribution  ot  tl\e  maximum  likelihood  estimate,  the  two  parameters, 
l.e..  the  conditional  nn'an  and  variance,  are  t*  and  the  Inverse 
of  the  test  information  tunction. 

Kst  imat  ion  ot  the  operatinjt  characlerist  Ics  of  tlu*  ^traded 
item  response  cateftv'rles  will  be  pursued  further,  bv  makint;  more 
use  ot  the  characteristics  of  the  polvnomlals  obtained  by  the 
method  of  nK'ments  fcf.  SameJ  iuwt  and  l.lvin>;ston ,  l^7'>),  and  so  forth. 

In  so  di'ln,i;,  the  role  of  the  Constant  lnform.atlon  Model  will  be 
important,  as  lonj;  as  we  wish  to  use  the  m«*thods  for  a  newly  developed 
item  pv'ol,  without  dependlnn  upon  a  set  ot  "known"  items,  v>r  the 
Old  Test.  One  prv''blem  we  should  seriously  Investljtate  before 
usiu);  the  Constant  Information  Model  for  a  subset  ot  "unknoVl^\," 
equivalent  items  v't  the  Item  pool  concerns  the  differences 
in  the  speed  of  converjtence  to  the  normality  caused  by  the  positions 
ot  0  in  the  Interval,  (0,  0)  (ct  .  .'tamejima,  l^i?'*).  In  the  present 
paper,  we  shall  pursue  this  subject  throujth  a  Monte  Carlo  studv. 


-A- 

11  Method  and  Data  Calibration 

For  the  coomon  item  characteristic  function  of  the  hypothetical 
equivalent,  binary  items.  Constant  Information  Model  with  the  parameters, 

{a  -  0.25 
* 

b^  -  0.00  , 

was  used.  The  interval  of  6  for  which  the  item  information  function 
assumes  a  positive  constant  is,  therefore,  given  from  (l.A),  (1.5)  and 
(1.6)  by 

(2.2)  -IT  <  e  <  IT  , 

and  we  have  for  the  amount  of  item  information 

(2.3)  I  (0)  -  0.25  . 

g 

As  the  fixed  levels  of  the  latent  trait  0  ,  eight  positions 

were  selected,  l.e.,  -3.0,  -2.2,  -1.4,  -0.6,  0.2,  1.0,  1.8  and  2.6  . 

A  group  of  one  hundred  hypothetical  examinees  were  assigned  to  each 

of  the  eight  levels  of  ability  6  ,  to  make  the  total  number  of 

hypothetical  examinees  eight  hundred.  There  were  twenty  hypothetical 

sessions  of  testing,  and  in  each  session  ten  equivalent,  binary  items 

were  administered.  An  item  score  u  (-  0  or  1)  was  calibrated  by 

g 

the  Monte  Carlo  method  following  the  Constant  Information  Model, 
whose  parameters  were  given  in  the  preceding  paragraph.  After  the 
completion  of  each  session,  the  (cumulative)  test  score  t  such  that 

■  Z  u  , 

u  cV  * 
g 


(2.4) 


t 


where  V  is  the  response  pattern  obtained  after  the  completion  of 
each  session,  or  the  vector  of  item  scores  to  the  items,  which  were 
administered  by  the  end  of  the  session,  was  computed  for  each  of 
Che  eight  hundred  hypothetical  examinees.  Thus  after  Che  completion 
of  the  k-th  session  the  full  test  score  is  10  x  k  .  The  maximum 
likelihood  estimate  6  was  obtained  by 

(2.5)  0  -  P'^U/dOk)] 

^  -1  1/2 
=  4  sin  ^{[t/(10k)]  -  ir 


for  each  hypothetical  subject,  after  the  completion  of  the  k-th  session 
(cf.  Samejima,  1979). 

Figure  2-1  illustrates  three  different  transformations  of  6  to 
,  T2  and  I  respectively,  through  the  formulae  such  that 

(2.6)  -  5.00  sin^[0.25e  +  (7r/4)  ]  -  2.50  , 

(2.7)  T,  -  P**"^  [sln^{0.25e  +  (tt/4)}]  , 

^  g 

where 


(2.8) 

and 


(2.9)  T3  -  (2/1.7)  log  (tan{(0.25e)  +  (n/A)}]  . 

They  are  the  three  transformations,  which  are  eventually  required  if 
the  true  item  characteristic  function  common  to  these  equivalent  items 


follows:  1)  the  linear  model  such  that 


FIGURE  2-1 


Transformations  of  0  to  t  Which  Result  from  Transforming  the  Constant 

Information  Model  with  a  •  0.25  and  b  ••  0.00  As  the  Parameters,  to 

g  g 

the  Linear  Model  with  a  -2.50  and  6  "2.50  (Broken  Curve),  to  the 

g  g 

Normal  Ogive  Model  with  a^  •  1.00  and  ■  0.00  (Solid  Curve),  and  to 

the  Logistic  Model  with  D  •  1.70  ,  a  *  1.00  and  b  “  0.00  (Dotted 

g  g 

Curve),  Respectively. 


(2.10) 


PMt  )  -  (t  -  a  )(6  -  a  )' 

Si  1  8  S  S 


for  u  <  T  <  , 

gig 


with  a 


'2.30  and  ii  ^  •  2.50  ,  2)  the  normal  ogive  model  such 


(2.11) 


P**(T,)  -  (2Tr)' 


jl  -  ^ 


exp(-u"/2]  du 


foi  -“<12*^'* 


with  a*  -  1.00  and  b*  -  0.00  ,  and  3)  the  logistic  model  such 

o  C> 


(2.12) 


'(Tj)  -  (1  +  exp{-l. 7a**(T ^-b**) } 1 


<  T  <  >“ 
3 


with  a**  -  1.00  and  b**  -  0.00  ,  respectively.  Table  2-1  presents 

o  8 

the  corresponding  values  of  ,  x,  and  X^  to  each  of  the  eight 
levels  of  0  ,  at  which  hypothetical  examinees  are  located. 


TABLE  2 'I 

Values  of  ,  T2  and  Corresponding  to  the  Eight  Levels 

of  6  ,  Which  Result  from  the  Transformations  from  the  Constant 

Information  Model  with  a  =•  0.25  and  b  =•  0.00  As  the 

g  g 

Parameters,  to  the  Linear  Model  with  a  =  -2.50  and 

g 

6  “  2.50  ,  to  the  Normal  Ogive  Model  with  a  =>1.00 

g  g 

and  ”  0.00  and  to  the  Logistic  Model  with  D  =•  1.70, 

a  *  1.00  and  b  =  0.00,  Respectively, 
g  g 


Constant 

Information 


Linear 

T, 


Normal 

Ogive 


Logistic 

T„ 


-3.0 

-2.494 

-3.023 

-3.930 

-2.2 

-2.228 

-1.604 

-1.680 

-1.4 

-1.611 

-0.923 

-0.900 

-0.6 

-0.739 

-0.379 

-0.358 

0.2 

0.250 

0.125 

0.118 

1.0 

1.199 

0.642 

0.614 

1.8 

1.958 

1.235 

1.240 

2.6 

2.409 

2.092 

2,345 

-9- 
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TabX*  3-1  presents  Che  sample  mean,  mg  ,  of  the  one  hundred 
maximum  likelihood  estimates  for  each  of  the  eight  groups  of  examinees, 
which  was  obtained  after  the  completion  of  each  of  the  twenty  sessions. 

It  is  observed  chat,  as  the  number  of  items  increases,  tends  to 

approach  Che  true  ability  d  on  each  ability  level.  And  yet  we  find 
substantial  differences  in  the  speed  of  convergence  across  the  different 
groups.  In  this  cable,  all  the  entries  whose  absolute  discrepancies 
from  the  true  9  are  greater  chan  or  equal  to  0.15  are  marked  with 
***,  chose  the  absolute  discrepancies  of  which  are  greater  than  or 
equal  to  0.10  and  less  Chun  0.15  are  marked  with  **,  and  those  whose 
absolute  discrepancies  are  greater  chan  or  equal  to  0.05  and  less  than 
0.05  are  marked  with  *  . 

We  can  see  from  Table  3-1  that,  for  Group  1,  whose  true  ability 
level  (■  -3.0)  is  very  close  to  the  lower  endpoint  0  (■  -n) ,  even 

after  200  items  were  administered,  the  discrepancy  of  mg  from  0  is 
as  large  as  0.061  .  In  contrast  to  this,  for  Group  5,  whose  ability 
level  is  0.2,  i.e.,  closest  of  all  the  eight  levels  to  Che  midpoint 
(-  0.0)  of  the  interval,  after  the  completion  of  the  first  se8.slon,  or 
the  administration  of  only  ten  items,  the  discrepancy  of  mg  from  9 
is  as  small  as  0.020  .  For  the  ocher  six  groups,  with  some  fluctuations, 
there  is  a  luminous  tendency  that,  as  the  true  ability  level  departs 
from  the  closer  end  point  of  tlie  Interval,  9  or  0  (“H),  the  conver¬ 

gence  of  mg  to  9  becomes  faster. 

Table  3-2  provides  us  with  the  sample  standard  deviation,  sg  , 


TABLE  3-1 

Sample  Mean  of  the  Maximum  Likelihood  Estimates  0  of  the  100  Hypothetical 
Examinees  of  Each  of  the  Eight  Ability  Levels,  after  Completing  Each  of  the 

Twenty  Sessions  of  Testing. 


Group 


Session' 


-3.0 


3.lJ$ 

3.1« 

3.l5g 

3.  its 

3.  if! 
** 
3.115 
** 

3.117 

3.11? 

3.10§ 

3.105 

3.1^2 

3.09^ 

3.08^ 

3.0?! 

3.07? 

3.07? 

3.06? 

3.06? 


-2.2 


-2.5liV 

-2.aVo' 

-2.3?2 

-2.3o! 

-2.26? 

-2.26? 

-2.241 

-2.233 

-2.227 

-2.232 

-2.220 

-2.211 

-2.218 

-2.212 

-2.209 

-2.198 

-2.197 

-2.191 

-2.192 

-2.194 


* 

-1.451 

-1.445 

-1.440 

-1.414 

-1.412 

-1.416 

-1.416 

-1.415 

-1.411 

-1.419 

-1.419 

-1.422 

-1.419 

-1.414 

-1.407 

-1.404 

-1.400 

-1.401 


■0.677 

•0.637 

0.625 

0.596 

0.566 

0.559 

0.571 

0.563 

0.577 

0.580 

0.583 

0.587 

0.598 

0.594 

0.596 

0.598 

0.599 

0.608 

0.612 

0.611 


5 

6 

0.2 

1.0 

0.219 

BBCZI 

0.214 

0.195 

1.045 

0.200 

1.042 

0.203 

1.046 

0.220 

1.037 

0.205 

1.024 

0.202 

1.025 

0.201 

1.025 

0.197 

1.024 

0.196 

1.017 

0.186 

1.019 

0.188 

1.018 

0.190 

1.017 

0.196 

1.015 

0.198 

1.013 

0.199 

1.016 

0.202 

1.017 

0.206 

1.020 

0.206 

1.020 

1.811 

1.811 

1.810 

1.789 

1.795 
1.794 
1.806 

1.796 
1.801 

1.799 
1.803 

1.803 
1.801 
1.802  I 
1.801 

1.804 
1.804  I 

1.800 


AAA 

2.816 

2.759 

** 

2.731 
** 

2.732 
** 

2.706 

** 

2.702 

2.696 

2.686 

2.670 

2.674 

2.668 

2.660 

2.649 

2.635 

2.639 

2.629 

2.615 

2.613 

2.615 
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TABLE  3-2 

Sample  Standard  Deviation  of  the  Maximum  Likelihood  Estimates  B  of  the 
100  Hypothetical  Examinees  of  Each  of  the  Eight  Ability  Levels,  after 
Completing  Each  of  the  Twenty  Sessions  of  Testing. 


Croup 

1 

2 

3 

4 

5 

6 

7 

u-L- 

IKO)!"^ 

Session^ 

-3.0 

-2.2 

PH 

-0.6 

0.2 

1.8 

2.6 

_ 

1 

0.68^ 

0.98%* 

0.632 

2 

*** 

0.000 

Mi 

MM 

Mill 

0.460 

A* 

0.592 

Hi 

0.447 

3 

** 

0.465 

0.415 

0.359 

0.403 

0.368 

** 

0,482 

0.410 

0.365 

A 

0.407 

0.330 

0.319 

0.328 

0.297 

0.383 

0.374 

0.316 

5 

KHct 

0.297 

0.285 

0.273 

0.280 

0.274 

0.321 

0.347 

0.283 

6 

** 

0.113 

0.270 

0.261 

0.267 

0.252 

0.235 

0.267 

0.310 

0.258 

7 

0.114 

0.226 

0.232 

0.251 

0.229 

0.216 

0.248 

0.286 

0.239 

8 

** 

0.107 

0.212 

0.225 

0.235 

0.219 

0.211 

0.222 

0.276 

0.224 

9 

A* 

0.100 

0.214 

0.222 

0.217 

0.203 

0.197 

0.216 

0.262 

0.211 

10 

0.102 

0.214 

0.216 

0.205 

0.198 

0.196 

0.203 

0.239 

0.200 

11 

0.109 

0.200 

0.202 

0.213 

0.187 

0.187 

0.184 

0.233 

0.191 

12 

0.1l8 

0.186 

0.191 

0.198 

0.174 

0.178 

0.175 

0.223 

0.183 

13 

0.115 

0.180 

0.178 

0.188 

0.162 

0.166 

0.173 

0.210 

0.175 

14 

O.llS 

0.171 

0.167 

0.173 

0.163 

0.162 

0.169 

0.208 

0.169 

15 

0.124 

0.165 

0.166 

0.172 

0.154 

0.157 

0.170 

0.189 

0.163 

16 

0.130 

0.163 

0.163 

0.164 

0.146 

0.157 

0.163 

0.187 

0.158 

17 

0.131 

0.159 

0.160 

0.157 

0.140 

0.150 

0.158 

0.169 

0.153 

18 

0.132 

0.152 

0.157 

0.153 

0.137 

0.146 

0.154 

0.143 

0.149 

19 

0.135 

0.149 

0.157 

0.153 

0.135 

0.142 

0.156 

0.138 

0.145 

20 

0.135 

0.141 

0.153 

0.155 

0.130 

0.137 

0.150 

0.121 

0.141 

A 

0.05 

< 

|sg  -  {I(0)}“^^^ 

AA 

0.10 

< 

|sg  - 

AAA 

0.15 

|sg  -  (1(0)}“^^^ 
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which  was  calculated  by  using  lOk  as  the  denominator  after  the  comple¬ 
tion  of  the  k-th  session,  of  the  one  hundred  maximum  likelihood  estimates 

for  each  group  after  the  completion  of  each  session.  In  the  same  table, 

-1/2 

the  values  of  (1(6)]  are  also  given  in  the  tightest  hand  column, 

for  the  purpose  of  comparison.  Some  of  the  entries  of  this  table 

are  marked  with  ***,  **  or  *  ,  following  the  same  rule  as  we  used  for 

the  sample  mean  mg  ,  in  accordance  with  the  absolute  discrepancy 

-1/2 

of  the  sg  from  11(0)]  .  We  observe  a  tendency  in  Table  3-2, 

which  is  very  similar  to  the  one  we  have  found  in  Table  3-1,  the  sample 

means. 

We  notice  that  for  Group  1,  whose  ability  level  is  very  close 
to  the  lower  endpoint  of  the  interval,  the  sample  standard  deviation 
sg  equals  zero  after  10,  20,  30  and  AO  items  have  been  administered, 
respectively.  This  owes  to  the  fact  that  the  response  patterns  of 
all  the  one  hundred  examinees  of  this  group  consist,  uniformly,  of 

■  0  ,  up  to  the  fourth  session,  and  the  likelihood  function  has, 
therefore,  the  terminal  maximum  at  0  -x  .  Since  we  have  from 
(1.2) 

(3.1)  Pg(-3.0)  -  sln^I(-3.0  +  it)/A] 

A  0.00125  , 

which  is  very  close  to  zero,  the  above  results  arc  understandable. 

Figure  3-1  presents  the  cumulative  frequency  ratios  of  the 
maximum  likelihood  estimates  of  the  one  hundred  examinees  of  Group  1 
(solid  line),  along  with  the  normal  distribution  function,  N(0,  1(0)  ^) 
(solid  curve),  after  the  completion  of  each  of  the  twenty  sessions. 


ill  M  laiHin 


Croup  1,  Session 


s  6  i  on 


-19- 


In  Che  same  figure,  also  presented  are  the  corresponding  normal 

2 

dlsCrlbuClon  functions  with  mg  and  sg  as  the  two  parameters 

(dotted  curves) .  These  normal  distribution  functions  coincide 

with  Che  corresponding  cumulative  frequency  ratios  in  the  first 

four  graphs  of  Figure  3-1,  i.e.,  for  sessions  1,  2,  3  and  4  , 

since  the  sample  variance  is  zero  in  these  four  cases.  The  dis- 

2 

crepancy  of  the  cumulative  frequency  ratio,  and  that  of  N(mg,  sg)  , 
from  the  other  normal  distribution  function,  N(6,  1(6)  ^)  ,  are 
substantially  large  even  in  the  last  graph  of  Figure  3-1,  where 
the  maximum  likelihood  estimates  were  obtained  on  the  basis  of  Che 
responses  to  two  hundred  items.  It  is  also  worth  noting  that  the 
cumulative  frequency  ratio  of  each  of  the  twenty  graphs  of  Figure  3-1 
indicates  a  J-shape  curve  for  the  frequency  distribution  of  the  one 
hundred  maximum  likelihood  estimates,  the  result  which  confirms 
our  theoretical  anticipation  made  earlier  (cf.  Samejima,  1979). 

A  similar  set  of  twenty  graphs  as  Figure  3-1  was  made  for 
Group  8,  whose  ability  level  (=  2.6)  is  the  next  closest  to  one  of 
the  endpoints  of  the  interval,  (-it,  tt)  ,  and  is  presented  as  Figure  3-2. 
We  find  in  this  figure  that,  unlike  the  case  of  Group  1,  the  cumulative 
frequency  ratio  of  the  one  hundred  maximum  likelihood  estimates  shows 
a  good  convergence  to  the  normal  distribution  function,  N(0,  1(6)  ^) , 
in  each  of  the  last  three  or  four  graphs,  or  after  170  or  more  items 
have  been  administered  Co  the  examinees.  For  this  ability  level, 
although  a  J-shape  curve  for  the  frequency  distribution  of  the  one 
hundred  maximum  likelihood  estimates  is  indicated  by  the  cumulative 
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frequency  ratio  of  each  of  the  first  three  graphs,  following  the 
transition  stage  of  sessions  A  and  5,  It  Is  shifted  to  a  unimodal 
curve  in  later  graphs.  This  result  can  be  explained  by  the  fact 
that  the  ability  level  of  Group  8  departs  from  the  upper  endpoint  of 
the  Interval,  (-n ,  n) ,  by  approximately  0.5A2  ,  while  that  of  Group 
1  Is  greater  than  the  lower  endpoints  only  by  0.142  .  The  condi¬ 
tional  probability  of  failure  for  each  equivalent,  binary  Item  for 
Croup  8  Is  given  by 

(3.2)  1  -  P  (2.6)  -  coa^((2.6  +  tt)/41 

8 

4  0.01822  . 

This  value  Is  approximately  fifteen  times  as  large  as  the  conditional 
probability  of  success  for  each  item  for  Group  1,  which  was  given  as 
(3.1). 

Hie  convergence  of  the  cumulative  frequency  ratio  to  the 

normal  distribution  function,  N(0,  1(6)  ^)  ,  Is  much  faster  If  we 

shift  the  ability  level  one  more  step  closer  to  the  center  of  the 

Interval,  (-^,  tt)  .  This  can  be  seen  in  Figure  3-3,  which  presents 

the  similar  set  of  twenty  graphs  for  Group  2.  For  this  group  of 

one  hundred  examinees,  0  "  -2.2  ,  which  is  much  farther  from  the 

lower  endpoint  of  the  interval  than  -3.0  .  After  the  administration 

2 

of  70  Items,  the  two  normal  distribution  functions,  N(mg,  sg)  and 
N(0,  1(0)  ^)  ,  are  almost  overlapping  with  each  other  for  Group  2. 
This  Is  a  remarkable  Improvement,  if  wo  compare  tljis  result  with  the 
fact  that  it  took  I/O  Items  to  administer  for  Group  8,  for  which 
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6  •  2.b  ,  to  actileve  this  level  ot  closeness.  A  J-shape  treijuency 
dlst  i' thul  Ion  v)f  the  maximum  llkellhuod  estimate  is  indicated  by  the 
cumulative  frequency  ratio  only  in  the  first  graph  of  Figure  3-3,  i.e., 
after  ten  items  have  been  administered  to  the  examinees,  and,  after 
that,  unimodal  frequency  distributions  are  observed. 

The  remarkable  Improvement  in  the  speed  of  convergence  of  the 
cumulative  frequency  ratio  of  one  hundred  maximum  likelihood  estimates 
for  Group  2  in  comparison  with  Group  8  can  be  explained  from  the 
difference  between  the  conditional  probability  of  success  for  Group  2 
and  that  of  failure  for  Group  8  .  fhe  former  conditional  probability 
is  given  by 

(3.3)  P,(-2.2)  -  sitr((-2.2  +  :i)/41 

s 

-  0.U5440  , 

whereas  the  latter  conditional  probability  was  already  obtained  in 
(3.2),  which  is  approximately  one-third  of  P  (-2.2)  .  This  dltfer- 
ence  between  the  two  conditional  probabilities  indicates  that  the 
probability  with  which  the  examinee  of  Group  2  obtains  the  str, -tly 
decreasing  likelihood  function  witli  the  terminal  maximum  at  0  »  -ti 
is  much  smaller  than  the  one  with  which  the  examinee  of  Group  8 
obtains  the  strictly  increasing  likelihood  function  with  the  terminal 
ma.xlmum  at  0  ■  ti  ,  especially  when  the  number  of  items  is  svibst ant  ial  ly 
large . 

So  far  we  have  observed  the  results  vif  the  three  groups  of 
one  hundred  examinees  wtiose  ability  levels  are  close  to  one  ot  the 


w 
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endpolnts  of  the  interval,  (-ti,  ti)  .  In  contrast  to  those  results, 
we  shall  observe  a  similar  set  of  twenty  graphs  for  Group  5,  for 
which  6  “  0.2  ,  i.e.,  closest  to  the  center  of  the  interval.  Figure 
3-4  presents  these  results. 

We  can  see  in  this  figure  that,  even  in  the  first  graph,  or 

after  the  administration  of  only  ten  items,  the  two  normal  distribution 
2  -1 

functions,  N(m;5,  ss)  and  N(0,  1(6)  )  ,  are  very  close.  We  also 

0  0 

notice  that  there  are  more  varieties  of  different  values  of  the  maximum 
likelihood  estimate,  and  thus  the  cumulative  frequency  ratio  is  much 
smoother,  in  earlier  graphs,  in  comparison  with  those  for  the  other 
three  groups  we  observed  earlier.  There  is  no  doubt  that  the  speed 
of  convergence  of  the  distribution  of  the  maximum  likelihood  estimate 
to  the  normality  is  much  higher  on  this  level  of  6  . 

Similar  sets  of  twenty  graphs  for  the  remaining  four  groups. 
Groups  3,  4,  6  and  7  ,  are  presented  in  Appendix  I  as  Figures  A-1-1, 

A- 1-2,  A- 1-3  and  A-1-4,  respectively.  We  can  say  from  these  results 

that,  except  for  Group  7,  whose  ability  level  is  1.8  ,  the  speed  of 
convergence  of  the  cumulative  frequency  ratio  of  the  maximum  likelihood 
estimate  is  quite  high.  The  conditional  probability  of  success 
in  one  binary  item  for  each  of  these  four  groups  of  subjects  is  given 
In  Table  4-4  which  will  be  presented  in  the  next  section. 
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IV  Further  Observation  of  the  Results 


We  have  observed  the  sample  mean,  ,  and  the  sample  standard 

deviation,  ,  of  the  one  hundred  maximum  likelihood  estimates  of 

each  of  the  eight  groups  of  examinees,  after  the  completion  of  eaclt 
of  the  twenty  sessions  of  testing.  We  have  also  observed  the  cumula¬ 


tive  frequency  ratio  in  each  case.  In  comparison  witl>  ttie  normal 
distribution  function,  N(0,  1(0)  S  •  For  further  observation, 
hero  we  consider  the  skewness  index,  Cj^  ,  which  is  defined  by 


(4.1) 


r  N  ■  m  ^  s 

10  30  0 


where  m^^  is  the  third  sample  moment  about  the  mean,  given  by 

100  , 

(4.2)  ra^g  -  (1/100)  E  (6^  -  mg) 


Table  4-1  presents  the  value  of  this  sample  skewness  index,  c^g  . 
for  the  set  of  the  one  hundred  maximum  likelihood  estimates  of  each 
of  the  eight  groups  of  ex.nminees,  obtained  after  the  completion  of 
each  of  tlie  twenty  sessions.  From  (4.1),  it  is  obvious  that  a 
positive  value  in  the  table  indicates  a  positive  skewness,  and  a 
negative  value  means  a  negative  skewness.  In  this  table,  again 
the  three  marks,  ***,  **  and  *  ,  are  assigned  to  some  of  the 
entries.  The  first  mark  is  attached  to  all  the  values  of  c^g  , 
whose  absolute  values  are  greater  than  or  equal  to  1.5  ,  the  second 
one  is  for  those  the  absolute  values  of  whicit  are  greater  than  or  equal 
to  1.0  and  less  than  1.5  ,  and  the  lliird  one  is  assigned  to  those 
whose  absolute  values  are  greater  th.in  or  equal  to  0.5  and  less  than 


i 


TABLE  4-1 


1.0  .  We  notice  that  the  first  four  entries  of  the  column  of  Group 
1  in  the  table  are  indeterminate,  since  both  the  third  and  second 
moments  about  the  mean  are  zero  in  each  of  these  four  cases. 

It  is  interesting  to  note  that,  for  Group  2,  the  sign  of  the 
index  is  reversed  between  the  first  and  second  sessions,  and,  for 
Group  8,  it  is  reversed  between  the  fourth  and  fifth  sessions,  while 
it  is  consistently  positive  for  Group  1.  Unlike  Tables  3-1  and 
3-2,  which  were  presented  in  the  preceding  section  and  are  for  the 
sample  mean  and  the  sample  standard  deviation,  respectively,  the 
configuration  of  the  values  ot  the  sample  skewness  index  in  Table  4-1 
is  somewhat  more  complicated  and  seemingly  more  difficult  to  inter¬ 
pret. 

This  problem  of  interpretation  will  be  solved,  however,  if  we 
pay  attention  to  the  frequencies  of  the  two  extreme  values  of  the 
maximum  likelihood  estimate,  -n  and  it  .  Table  4-2  presents  the 
frequency  distribution  of  these  two  values  for  each  group  and  for 
each  session.  In  this  table,  the  numbers  appearing  in  the  first 
four  columns  are  the  freqencies  of  -n  ,  and  those  in  the  last  three 
columns  are  those  of  it  .  For  Group  5,  the  frequencies  are  zero  for 
both  -TT  and  tt  throughout  the  twt^nty  sessions. 

We  can  see  in  Table  4-2  that,  for  Group  1,  even  after  the 
administration  of  200  items,  the  terminal  maximum  for  the  likelihood 
function  at  the  lower  end  of  the  Interval  (-tt,  tt)  occurred  73 
times  out  of  100  .  This  explains  the  J-shape  cumulative  frequency 
ratios  of  the  maximum  likelihood  estimate  for  Group  1  throughout  the 


: 


-44- 


TABLE  4-2 

Frequencies  of  -n  As  the  Maximum  Likelihood  Estimate  Appearing  In 
the  First  4  Columns  and  Those  of  n  Appearing  In  the  Last  3  Columns. 
The  Total  Number  of  Maximum  Likelihood  Estimates  in  Each  Cell  is  100. 
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1 
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3 

B 

5 

6 

7 
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SesTlotT^ 

-3.0 

-2.2 

-0.6 

0.2 

1.0 

1.8 

2.6 

1 

100 

62 

D 
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0 

8 

37 

75 

2 

100 

35 

0 

0 

1 

11 

66 

3 

100 
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0 

0 

0 

0 

mm 

43 

5 

99 

H 

0 

0 

0 

0 

B 

39 

6 

95 

0 

0 

0 

0 

0 

30 

7 

94 

0 

0 

0 

0 

0 

0 

26 
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94 

0 
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0 

0 

0 

23 

9 

94 

0 
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0 

0 

0 

0 

20 

10 

93 

0 

0 

0 

0 

0 

0 

15 

11 

91 

0 

0 

0 

0 

0 

0 

14 

12 

90 

0 

0 

0 

0 

0 

0 

12 

13 

89 
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0 

0 

10 

14 

87 
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0 

0 

0 

0 

9 

15 

84 

0 

0 

0 

0 

0 

0 

6 

16 

80 

0 

0 

0 

0 

0 

0 

6 

17 

79 
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0 

0 

0 

4 

18 

78 

0 

0 

0 

0 

0 

0 

1 

19 

76 

0 

0 

0 

0 

0 

0 

1 

20 

73 

0 

0 

0 

0 

0 

0 

0 
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cwenty  sessions  of  testing,  as  we  have  observed  In  the  preceding 
section.  On  the  other  hand,  a  close  observation  of  the  relationship 
between  the  skewness  Indices  and  the  frequencies  of  tite  two  extreme 
values  of  the  maximum  likelihood  estimate  reveals  that,  for  Groups 
8,  2  and  7  ,  the  reversal  of  the  sign  of  the  skewness  Index  occurs 
somewhere  around  the  frequency  40  of  the  extreme  values  of  the  maximum 
likelihood  estimate.  Why  Is  It  so? 

To  answer  this  question,  we  must  call  our  attention  to  the 
fact  that  there  Is  a  substantial  gap  between  one  of  the  extreme 
values  of  the  maximum  likelihood  estimate  and  the  adjacent  one. 

To  be  more  specific,  for  Ses.sion  1,  or  after  the  administration  of 
10  Items,  the  value  of  the  maximum  likelihood  estimate  corresponding 
to  the  second  best  test  score,  9  ,  is  1.854b  ,  which  is  obtained 
through  (2.5)  with  k  •  1  .  Thus  the  difference  of  this  value  of 
the  maximum  likelihood  estimate  from  ’ll  is  as  large  as  1.2870  . 

The  same  amount  of  discrepancy  exists  between  and  the  adjacent 

maximum  likelihood  estimate,  -1.854b  ,  which  corresponds  to  the 
second  lowest  test  score,  1  .  Even  for  Session  20,  or  after  all 
the  200  items  have  been  administered,  tills  discrepancy  is  still  as 
large  as  0.2831  ,  since  the  maximum  likelihood  estimate  corresponding 
to  the  second  highest  test  score,  199  ,  is  2.8585  ,  and  the  one 
corresponding  to  the  second  lowest  test  score,  1  ,  is  -2.8585  . 

We  notice  that,  when  -tt  ,  one  of  the  two  extreme  values  of 


the  maximum  likelihood  estimate,  has  a  high  frequency,  it  makes  the 
total  frequency  distribution  of  the  maximum  likelihood  estimate 
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J-shaped,  causing  a  positive  skevmess  to  it.  On  the  other  hand, 
when  such  an  extreme  value  has  a  low  frequency,  because  of  the  long 
distance  from  the  adjacent  value  of  the  maximum  likelihood  estimate, 
it  creates  a  long  tail  on  the  left  hand  side  of  the  total  frequency 
distribution,  and,  therefore,  causes  a  negative  skewness  to  the 
frequency  distribution.  We  can  easily  see  that  the  same  logic 
applies  for  the  frequency  of  the  other  extreme  value,  ,  of  the 
maximum  likelihood  estimate,  and  negative  and  positive  skewnesses. 
Considering  these  facts,  we  are  in  a  good  position  to  understand 
the  meaning  of  the  total  configuration  of  the  skewness  indices 
in  Table  4-1.  It  should  also  be  pointed  out  that  those  which 
Involve  neither  of  the  two  extreme  values,  -tt  and  tt  ,  of  the 
maximum  likelihood  estimate  are  small  enough  to  be  considered  as 
sampling  fluctuations.  In  fact,  only  a  few  of  them  are  marked 
with  *  ,  and  none  of  the  others  exceeds  0.5  in  absolute  value. 

If  we  combine  the  result  in  Table  4-1  with  the  sample  means 
of  the  maximum  likelihood  estimate,  which  are  given  in  Table  3-1 
in  the  preceding  section,  the  effect  of  the  two  extreme  values 
of  the  maximum  likelihood  estimate  is  more  conspicuous.  We  notice 
that  all  the  sample  means,  which  Involve  one  or  more  -fr  or  tt  , 
are  shifted  from  the  true  values  of  0  to  the  directions  indicated 
by  the  signs  of  the  extreme  values.  It  is  also  noted  that  all  the 
sample  moans,  which  involve  neither  -tt  nor  it  ,  have  very  small 
discrepancies  from  the  true  values  of  0  ,  l.e.,  less  than  0.05 
in  absolute  value. 


Ill  I  1 1^  i  ~  r~  "i  ~ 


Table  4-3  presents  the  kurtosis  index,  c,,.  ,  which  is 

t  c 

de fined  by 

(4.3)  c^g  •  ra^g  sg  -  3  , 

where  is  the  fourth  sample  moment  of  the  one  Imndred  maximum 

likelihood  estimates,  for  which  we  can  write 

100  , 

(4.4)  m,fi  -  (1/100)  I  (8,  -  m^^)  , 

i-1  ^ 

for  each  of  the  eight  groups  of  examinees  after  the  completion  of 
each  of  the  twenty  sessions.  The  three  types  of  marks,  ***,  ** 
and  *  ,  are  assigned  in  this  table  by  following  the  same  rule 
as  we  have  used  for  the  skewness  index  in  Table  4-1.  As  is  well 
known,  this  kurtosis  index  is  defined  in  comparison  with  the  normal 
distribution,  i.e.,  a  positive  value  indicates  a  greater  peakedness, 
and  a  negative  value  a  lesser  peakedness,  than  the  normal  distri¬ 
bution. 

We  notice  that,  again  in  these  results,  there  exists  a  certain 
effect  of  the  extreme  values,  -n  and  Ti  ,  of  the  maximum  likelihood 
estimate.  In  fact,  we  observe  two  different  transitions  there, 
one  of  which  is  from  positive  to  negative  values,  and  the  other 
from  negative  to  positive  values.  The  former  transition  stage 
corresponds,  approximately,  to  the  frequency  75  of  -n  or  tt  , 
and  the  latter  transition  stage  corresponds,  roughly,  to  the  frequency 
13  .  Sampling  fluctuations  appear  to  be  greater  for  the  kurtosis 


Index  than  for  the  skewness  index,  the  fact  which  coincides  with 


TABLE  4-3 


Sample  Kurtosis  Index  of  tlie  Maximum  Likelihood  Estimates  0  of  the  100 
Hypothetical  Examinees  of  Each  of  the  Eight  Abilitv  Levels,  after 
Completing  Each  of  the  Twenty  Sessions  of  Testing. 
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0.115 
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1.0 

-0.499 
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0.375 

* 
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2.071 

-0.421 
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-0.077 

0.014 
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-0.015 
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0.183  I 

0.472  ! 

0.492 

0.867 
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0.858 

* 
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our  general  knowledge  of  greater  sampling  fluctuations  for  higher 
order  moments. 

Table  4-4  presents  the  conditional  probability  of  success, 

Pg(0)  .  for  a  single  item,  which  is  given  by  (1.2),  for  each  of 

the  eight  ability  levels,  together  with  the  item  information  function 

I  (0)  ,  and  three  other  item  information  functions  obtainable 
g 

after  the  transformation  of  6  to  and  through 

(2.6),  (2.7)  and  (2.9),  respectively.  These  item  information 

functions  are  obtained  by  multiplying  I  (6)  with  the  square  of 

g 

the  derivative  of  0  with  respect  to  the  corresponding  transformed 
variables,  respectively  (cf.  Samejima,  1979). 


TABl.K  4-A 
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V  lU s c iis s  i on  and  I'on o  1  u s li’ n_s 

Wo  have  touiui  out  tliat  t  lio  oonvo  i  i;onoi>  I'l  tin'  inmn  1  at  I  vo 

froqnoncy  dlst  rilnit  ion  ot  Cho  maximum  likollhood  I'Stimato  to  t  lio 

normal  distribution  function,  NfO,  1(01  S  .  is  subst ant ial Iv 
slower  for  the  levels  of  ability  close  to  one  of  tbe  two  endpoints 
of  tbe  interval,  (-n,  i')  ,  in  comparison  with  the  levels  ot  ability 
close  to  tbe  center  of  the  Interval.  It  has  become  obvious  tliat 
tbe  frequency  of  one  ot  tbe  two  extreme  values,  -ti  and  n  ,  of 
tbe  maximum  likeliltood  estimate  is  a  jtood  indicator  of  tbe  dejtree 
of  convergence,  since  it  affects,  stron>;ly,  tbe  moments  of  tbe 
freqviency  distribution.  One  import  atit  Implication  of  this  findinj; 

is  that  we  should  avoid  usiu);  tbe  normal  distribution,  N(0,  IfO)  S  , 

as  the  approximation  to  tbe  condition.il  distribution  of  tbe  m.ixlmum 
likelihood  estimate,  unless  there  are  a  lar^e  enough  number  of  items 
so  that  tbe  conditional  probability  v)f  tlie  "all  success"  response 
pattern,  and  that  of  tbe  "all  failuie"  response  pattern,  are  ne>;libly 
small. 

To  pursue  this  further,  the  conditional  probabilltv  of  the 
"all  success"  response  pattern,  and  tliat  of  the  "all  failure"  response 
pattern,  were  calcul.ited  from  1'  (dl  ,  wbiih  is  presenti'd  lor  e.uh 
of  tlie  elubt  different  levels  of  .il<illtv  in  Table  4-4  ot  the  precedin^t 
section.  Since  all  tlie  items  .ire  >'tiuivalent ,  these  conditional 
probabilities  are  r^(d)"  .imi  |l  -  l\(i')l"  ,  respectively.  Kor 
0  ••  -3.0  ,  or  the  ability  level  ot  I'.roup  1,  when  we  Increase  the 
number  of  items  to  Jib,  SIJ,  I, OJA,  J,04S  and  4,0db,  the  conditional 
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probablllty  of  the  "all  failure"  response  pattern  becomes  0.725, 

0.52b,  0.277,  0.077  and  0.006,  respectively.  Tills  means  that, 
even  if  more  than  1,000  items  are  administered,  we  must  expect 
approximately  28  examinees  out  of  100  will  obtain  the  "all  failure" 
response  pattern,  and  we  need  more  than  4,000  items  to  reduce  it 
to  0.6  person  out  of  100  examinees.  These  outcomes  are  exactly 
the  same  for  0  -  3.0  ,  when  we  consider  the  conditional  probability 
of  the  "all  success"  response  pattern.  We  may  conclude,  therefore, 
that  for  the  values  of  0  3.0  or  greater  in  absolute  value  we  should 
give  up,  totally,  the  idea  of  using  the  normal  approximation  to  the 
conditional  distribution  of  the  maximum  likelihood  estimate.  For 
0  “  2.6  ,  if  we  increase  the  number  of  items  from  32  to  64,  128,  256 
and  512,  the  conditional  probabilities  of  the  "all  success"  response 
pattern  are  0.555,  0.308,  0.095,  0.009  and  0.00008,  respectively. 

Thus  we  can  say  that,  for  this  level  of  ability,  if  we  use  approxim¬ 
ately  300  items,  chances  are  slim  for  one  person  out  of  100  examinees 
to  obtain  tt  as  the  maximum  likelihood  estimate.  This  number, 

300,  is  still  too  large  for  practical  purposes,  however,  and  we  must 
say  that  in  the  vicinity  of  0  ■  2.6,  and  also  that  of  0  •  -2.6, 
the  idea  of  using  the  normal  approximation  to  the  conditional  distri¬ 
bution  of  the  maximum  likelihood  estimate  is  unrealistic.  The 
situation  Is  much  more  ameliorated,  however,  if  we  switch  to  the 
ability  level,  0  -  -2.2  .  On  this  level  of  ability,  for  8,  16, 

32,  64  and  128  items,  the  conditional  probabilities  of  the  "all 
failure"  response  pattern  are  0.639,  0.409,  0.167,  0.028  and  0.0008  , 
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respectivoly .  Thus  wo  can  concluiio  that  ,  as  lonn  as  wo  liave  100 
itoms.  It  is  unlikoly  to  happen  that  one  person  out  of  one  hundred 
examinees  obtains  -n  as  his  maximum  likelihood  estimate.  This 
is  well  exemplified  and  supported  by  the  result  of  our  Monte  Carlo 
study,  which  is  Riven  in  Table  4-^  of  the  precedlnR  section.  In 
otlier  words,  we  can  be  assured  that  100  items  are  enough  as  far  as 
we  use  the  subinterval  of  0  ,  (-2.2,  2.2)  ,  for  which  the  normal 
approximation  to  the  conditional  distribution  of  the  maximum  likeli¬ 
hood  estimate  is  adopted.  Tliis  number  of  items  will  be  reduced 
to  half,  i.e.,  50  ,  if  we  switcli  the  sublnteival  of  0  from 
(-2.2,  2.2)  to  (-1.8,  1.8)  ,  since  for  0  -  1.8  the  conditional 
probabilities  of  the  "all  success"  response  pattern  are  0.832,  0.400, 
0.160,  0.025  and  0.0006  ,  for  4,  8,  16,  32  and  64  items,  respectively. 
If  we  need  to  use,  say,  only  35  itoms,  we  must  narrow  down  the  interval 
further  to  (-1.4,  1.4)  ,  for  the  conditional  probabilities  of  the 
"all  failure"  response  pattern  at  0  -  -1.4  ,  or  those  of  the  "all 
success"  response  pattern  at  0  «  1.4  ,  are  0.676,  0.457,  0.20‘), 

0.044  and  0.002  for  2,  4,  8,  16  and  12  items,  respect ivelv.  We 
notice  that  the  gain  in  the  reduction  of  tlie  number  of  items  is 
decreasing  compared  with  the  amount  of  s.icrifice  in  tlie  Interval 
length,  and  it  is  probably  meaningless  to  consider  further  reductions 
of  items.  If  we  wish  to  use  a  set  of  "unknown"  equivalent  items 
as  a  substitute  for  the  Old  Test,  it  is  advisable  to  aim  .it  including 
35  to  50  equivalent  items  in  the  new  item  pool.  This  can  be  done 


through  the  content  analysis  of  ttie  items,  and  their  proportions 


correct  In  the  preliminary  study,  etc.  As  an  additional  information, 
the  conditional  probabilities  of  the  "all  success"  response  pattern 
for  the  ability  levels  1.0,  0.6  and  0.2  ,  or  those  of  the  "all  failure" 
response  pattern  for  the  ability  levels  -1.0,  -0.6  and  -0.2  ,  are: 

0.547,  0.299,  0.090  and  0.008;  0.420,  0.176,  0.031  and  0.001;  and 
0.302,  0.091,  0.008  and  0.00007;  for  2,  4,  8  and  16  items,  respectively. 

The  conclusion  made  in  the  preceding  paragraph  is  supported 
by  the  result  of  our  Monte  Carlo  study,  which  was  made  in  earlier 
sections.  In  the  graph  of  Session  10  in  Figure  3-3,  that  of  Session 
5  in  Figure  A-1-4,  and  that  of  Session  4  in  Figure  A-1-1,  which 
Involve  100  items  for  9  “  -2.2  ,  50  items  for  6  ■  1.8  and  40  items 
for  0  “  -1.4  ,  respectively,  the  normal  distribution  function, 

N(0,  1(0)  and  the  cumulative  frequency  ratio  are  sufficiently 
close.  We  find  in  Tables  3-1,  3-2,  4-1  and  4-3  that  the  sample 
mean,  the  sample  standard  deviation,  the  sample  skewness  index  and 

-1/2 

the  sample  kurtosls  index  are:  -2.232  (  0  “  -2.200  ],  0.214  [  1(0) 

■  0.200  ),  -0.277  and  0.003  in  the  first  case,  1.810  [  0  -  1.800  1, 

-1/2 

0.321  (  1(0)  '  -  0.283  1,  0.762  and  2.071  in  the  second  case,  and 

-1.445  I  0  -  -1.400  ],  0.330  [  1(0)“^^^  -  0.316  ],  -0.431  and  0.940 
in  the  third  case,  respectively.  We  can  say  that,  at  least,  the 
sample  mean  and  the  sample  standard  deviation  are  very  close  to 
the  asymptotic  values  in  each  case. 
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